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PREFACE 


In studying the progress of students through our educational sys- 
tem it is highly important to extend the investigation over as long a 
period as possible. As the author points out in the introductory 
chapter, this bulletin is the final report on a study of a group of several 
thousand high-school seniors in Illinois who have been followed 
through college or until they discontinued their collegiate training. 
Hence the collection of data extended over a period of five years. Dr. 
Odell deserves special commendation for continuing the investigation 
for so long a period in spite of the difficulties that inevitably handi- 
cap one in such a project. 

The conclusions presented in this report as well as those given in 
the three earlier reports should be of interest to those who are called 
upon to advise high-school graduates with reference to attending college 
and to those who advise college students. Although it appears that 
prediction on the basis of the data considered in the present study in- 
volves large errors, it is only by ascertaining the accuracy of the pre- 
dictions that may be made on various bases that we can learn how 
to predict more accurately. The Bureau of Educational Research is 
glad to present this final report of an extended study as a contri- 
bution in the field of educational guidance. 


Watter S. Monroe, Director 
July, 1930. 


Digitized by the Internet Arc ni 
in 2023 
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PREDICTING THE SCHOLASTIC SUCCESS 
OF COLLEGE STUDENTS 


CHAPTER I 
INTRODUCTION AND STATEMENT OF THE PROBLEM 


The recent increase in college enrollment and two resulting prob- 
lems. Many persons, among whom the present writer? is one, have 
called attention to the fact that among the noteworthy recent educa- 
tional trends in this country has been the marked increase in enroll- 
ment, especially in that of secondary and higher schools. Although the 
rate of increase has not been so great for the past two or three years 
as it was for the period immediately preceding this time, nevertheless 
it is still sufficient to possess decided significance. This is shown by 
the figures given in the last “Statistical Survey of Education”? is- 
sued by the United States Office of Education, which shows that at 
present approximately 1 per cent of our whole population is enrolled 
in higher institutions, whereas shortly before the close of the last cen- 
tury only about one-fourth of 1 per cent was so enrolled. 

This rapid increase in enrollment has been accompanied by a decline 
in the purchasing power of the dollar and a general demand that the 
scope of education be enlarged. As a result of the combined effect of 
these three factors, it has become very difficult to secure the amounts 
of money necessary to provide adequate educational facilities for all 
those who wish to avail themselves of them. The difficulty of doing 
so is probably greater in the field of higher education than in any other. 

At least two outstanding problems or questions have arisen in 
connection with the situation just mentioned. One is that of whether 
or not higher institutions shall admit practically all persons who have 
completed secondary work and wish to enter. Most state-supported 
institutions have done so with little or no restriction, whereas most of 
those deriving their support from other sources have exercised varying 
degrees of selection among applicants for admission. In most cases 


1The discussion in this section is practically the same as that found in the correspond- 

ing portion of ae. : ho ; ; 
dell, C. W. “Predicting the Scholastic Success of College Freshmen,” University 
of Illinois Bulletin, Vol. 25, No. 2, Bureau of Educational Research Bulletin No. 37. 


: University of Illinois, 1927. 54 p. yee We 
Aa OW, cone Collese Gir Select Group?” University of Illinois Bulletin, 
Vol. 24, No. 36, Bureau of Educational Research Bulletin No. 34. Urbana: University of 


inoi yf, ANG) fo q . 
Mlinols, ies ine Ar. “Statistical Survey of Education, 1925-1926,” U. S. Bureau of Edu- 
cation Bulletin, 1928, No. 12. Washington: Government Printing Office, 1928. 13 p. 


i 


the policies adopted have not been based upon thoroughgoing studies 
of the problem, and in no case upon conclusive evidence as to what is 


best; therefore, what should be done is still an open question. The 
data availablet seem to justify the conclusion that those who now — 


enter college constitute a marked selection of all who graduate from | 
high school, but that among them are many individuals who are, ap- 
parently, unable to carry the usual type of college® work successfully. 


Therefore it seems desirable, perhaps even necessary, that if colleges — 


are to continue to maintain their present standards of scholarship, 
they must exercise some degree of selection among those who wish to 
enter. From this assumption there follows the question as to what is_ 


the most satisfactory basis for making this selection. Do any data 


that are available or may be obtained concerning high-school gradu- 
ates provide satisfactory, or even helpful, means of predicting success, 


especially scholarship in college? If so, which of these data have — 


it 


most value for this purpose, and how reliable are they? 

The second question referred to is that of providing for college 
students of different aptitudes and abilities. If there is not considerable 
selection at the time of entering college, many of those admitted will 
fail to do satisfactory work in certain subjects, whereas in other sub- 
jects they will do at least passing, and perhaps even better, work. It is, 
therefore, desirable that educational guidance be provided for such 
students. This requires that, in so far as it is possible, colleges deter- 
mine the subjects or courses in which students are most likely to suc- 
ceed and those in which they are most liable to fail. Even if entrance 
requirements are highly selective and many applicants for admission 
are refused entrance, there is still a place for educational guidance, 
although the necessity for it is less acute. Furthermore, there has re- 
cently been much interest in the matter of providing different types 
or levels of instruction within the same subject and in’ other ways 
adapting the educational opportunities offered to the individual differ- 
ences of students. This has been much more prominent in elementary 
and high schools than in colleges, but within the past few years a 
considerable number of the latter have been devoting serious attention 
to the problem. In this case, also, the less selective is college admission, 
the greater is the need to provide for differences among students. Even 
when admission is highly selective, however, those allowed to enter 
do not constitute truly homogeneous groups, so that there is still need 
for seeking satisfactory bases oi classifying them and of adapting 
subject-matter to differences in ability, aptitude, and so forth. 


*Odell, op. cite, p. 26 


5The term “college” oat be used in a general sense to include all types of institutions 
of higher learning. 
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The purpose of this bulletin. It is the purpose of this bulletin to 
present what will probably be the final report of a study and evaluation 
of some of the more easily available items of information which 
probably are of value in foretelling scholastic success in higher in- 
stitutions. This bulletin is the fourth of the series concerned with this 
study. The first® presented data having to do with the individuals dealt 
with by the study while they were still in high school. The second? 
reported some data bearing upon the question of the selection of col- 
lege entrants from all high-school graduates. The third® gave the re- 
sults of comparing freshman college records with the various items 
of information collected before college entrance. The present one con- 
tinues the study throughout four years of college work, or as great 
a portion of these four years as the individuals concerned remained 
in college or could be followed. The writer has attempted to determine 
not only bases of prediction for college success in general, but also for 
each subject carried by any considerable number of the college students 
included. The purpose of this bulletin may, therefore, be stated as 
being to present data which show the accuracy of high-school marks, 
intelligence test scores, and other data in predicting the persistence in 
school and the marks of college students. Furthermore, the same 
problem will be dealt with from the standpoint of predicting success 
in the last three years of college upon the basis of freshman college 
marks in addition to the other predictive data mentioned above. The 
problem will be attacked primarily: by the methods of simple and mul- 
tiple correlation, and the accuracy of predictions based upon the best 
multiple regression equations will be shown. 

This bulletin will not devote any attention to the numerous studies 
of others who have been attacking the same problem as the present 
writer. There are at least three reasons for not doing so. One is that 
what seemed to the writer the outstanding studies of this type were 
briefly summarized in connection with the report of this study that 
dealt with the prediction of success of college freshmen.’ The second 
reason is that others'® have prepared such satisfactory summaries of 
what has been done in this field that an additional one does not seem 


6Odell, C. W. ‘Conservation of Intelligence in Illinois High Schools,’ University of 
Tilinois Bulletin, Vol. 22, No. 25, Bureau of Educational Research Bulletin No. 22, Urbana: 


iversi inois, 1925. 55 p. Sabet aos 2 
LL ae ar gee elicee, Students a Select Group?” University of Illinois Bulletin, 


Vol. 24, No. 36, Bureau of Educational Research Bulletin No. 34. Urbana: University 


f inois, 1927. 4 : ‘ p Ff : 
oy Ocal, t W. Noledichag the Scholastic Success of College Freshmen,” University 


of Illinois Bulletin, Vol. 25, No. 2, Bureau of Educational Research Bulletin No. 37. Urbana: 
University of Illinois, 1927. 54 p. 

Odell, op. cit. ; 

For one of the most recent summaries, see: , a ae 7 

Woody, Clifford (Chairman). ‘Quantitative Measurement in Institutions of Higher 
Learning,” Eighteenth Yearbook of the National Society of College Teachers of Education. 


Chicago: University of Chicago Press, 1930, Chapters III and IV. 


CHAPTER II 


THE GENERAL PLAN OF THIS STUDY 


The initial collection of high-school data. The data used in this 
investigation concern a group of individuals graduated from several 
hundred high schools in the state of Illinois in 1924 and admitted to 
various institutions of higher learning in the summer or autumn of the 
same year. In the fall of 1923 all the four-year public high schools 
in the state were invited to cooperate with the Bureau of Educational 
Research in this study. The number that did so was 368, a few more 
than one-half of all those within the state, and the number of seniors 
included was about 12,300. The data secured concerning them con- 
sisted of their scores upon the Otis Self-Administering Test of Mental 
Ability, Higher Examination, Form A and the answers to the questions 
on an “Information Blank for High-School Seniors,” which called for 
the following information: 


IN DSER Ecce cessahetctae A tees ake Rape ee ee a IMU UE con ie ieee Lea 
SOS ee acae le ace ate Nec Bel ee Is ea SERN SRN OO a ee a Ee De A we 
IDs Qe TB ee ae En ah Ee ae ee Ce ie ere ee Oe Oe 
ING CHONMOEDLCMDehe lee O25 tee een erer ee Sens re eh A eee 
Neate DES CliG @ | een eee re tenes Pl Secs Se RE Ne) ee a te a 
TERRE CORE OTN a as RS ee ee ee ee eee ee 
Intentions concerning further education 
CERIO mmo LeCOMELMUIIT oe en a oe ee ae ie ed) Jeet ee 
IIS EEC ET © flee meee ee eee ee Fe ee tM a, SPO esos, eed eres tet eee 
(CORRS Gs sates Bia tite cat a a ae ne ee ER ire EEA 
Ven Ong SET DC Can meee ee ae ae ye EE Ee eee ee Lee 
WMOGAtIOIla le ChOlCG =n ee ee 
Petite an OC COULD) eV tsk (1 eae ere cee trea ne Poe Nee enero See neces Svoscaztevecsonsree 
Information as to previous intelligence tests taken = 
UIT CORE TENSES tS YOON | ora BR mee erry rr 
Relies GHOG MS je CusmldleCe 111Os tamer ns te cee eee nese es ce cake cevecne eezanceseret reece sers 
TET Gel evaxayl Stel oy (GCOS: EMEC MSS er ee ERE RE SETS 
INGUETOIBVERS ub SEANTN URES airy LAWS) SVN SKO) ee perro he aa 
NSH: TAR CLO LeNL ST aa Ler SRT 


The tests were given by principals or by teachers designated by them 
and the information blanks filled out by the seniors themselves. All 
scoring of test papers and tabulation of results was done in the offices 
of the Bureau of Educational Research. 

The second step in collecting high-school data. A year later, in the 
fall of 1924, the 368 high schools were asked to furnish the complete 
high-school scholastic records of all pupils for whom the other in- 
formation had been secured, and also, if possible, to state what, if 


1This was the average mark up to date or for the first three years. It was secured 
from only a minority of the schools and for about 2700 seniors. 
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any, institution of higher learning each individual was attending. A 
few of the seniors of the year before had not been graduated, and in 
a few cases the desired records were not forthcoming, but the loss 
from these sources was comparatively small, so that the complete scho- 
lastic high-school records of about 11,500 graduates were secured. 
Since these marks came from several hundred schools which employed 
a total of over one hundred different marking systems, if all minor 
variations be counted, it was necessary to transmute them to a uni- 
form basis. For this purpose a percentile system with passing at 70 
and no conditions was chosen. The marks given according to all other 
plans were changed to this system by conventional and careful statis- 
tical procedure. 


The collection of college freshman data. Some three hundred in- 
stitutions of higher learning had been named by the seniors in answer 
to the question as to where they expected to continue their educa- 
tion. Early in the academic year of 1925-26 letters were addressed to 
all these institutions asking for the complete 1924-25 scholastic records 
of all freshmen coming from any of the high schools included in this 
study. About 7,700 of the seniors had stated that they intended to 
continue their education, in addition to many who were undecided, and 
the majority of them had named the institutions they expected to at- 
tend. Despite this fact the freshman records of not quite two thousand 
students were all that were secured. This loss is due to at least four 
causes. In the first place, a number of the colleges addressed either 
were unwilling to cooperate in the study, or, after expressing their 
willingness to do so, failed to send the desired records. A second 
reason was that a number of the institutions which did cooperate failed 
to furnish the data for all of their students for whom they were 
desired. Third, undoubtedly many of the high-school graduates who 
planned to attend college found it necessary, for financial or other 
reasons, to postpone entrance for a year or more after high-school 
graduation. The last, and probably the most important, reason was the 
fact that in filling out the information blanks the high-school seniors 
expressed their highest hopes and ambitions or gave answers which 
they thought would sound best and that, therefore, many of them who 
had very slight expectations of ever actually attending college, signified 
that they intended to do so. 

Of the approximately two thousand students whose records were 
secured from various colleges, almost one hundred did not remain in 
college long enough to have any marks recorded. The number for 
whom marks for at least one quarter, term, or semester were secured 


, 
. 


h 
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was 1,892, and for 1,677 of these a full year’s marks were obtained. 
As these marks were given by more than one hundred institutions it 
was necessary to transmute them to a common basis in the same 
manner as had been done for the high-school marks, and so all were 
adjusted to the same basis of a percentile marking system with 70 
as passing and no conditions. 


The collection of complete college data. If the individuals whose 

college freshman records were secured had continued in residence and 

earned the normal amount of credit, they would have been graduated 

in the spring of 1928. Therefore early in the autumn of 1928 the 

writer attempted to secure the complete college scholastic records of 

all those for whom college freshman data had been collected. The 

response from the institutions addressed was so very generous that 

practically none of those concerned were not followed up except a 

few who appear to have transferred from one institution to another 

and could not be located. Four hundred and eighty-six of those con- 

cerned were graduated at the end of the four-year or normal period. 

In addition to these about one hundred others were still in attendance 

at the end of four years. The remainder, about thirteen hundred, had, 

in so far as was known, dropped out of college at some time before the 

completion of the four-year period and had not returned. After the 

data for the whole college course had been collected, they were sub- 
jected to the same procedures as those for the freshman year. 


The reliability? of the data secured in this investigation. There is 
no doubt that in both intelligence test scores and high-school and col- 
lege marks large variable errors are present. No group intelligence test 
so far devised yields highly accurate individual scores, and the Otis 
Self-Administering Test, which requires only half an hour to give, 
is probably less reliable than one, such as the Thorndike Intelligence 
Examination, which consumes two or three hours. Moreover, the 
tests were not administered by a corps of trained and selected exam- 
iners, but by several hundred different principals and teachers, many 
of whom had probably never before given a standardized test. This 
fact undoubtedly served to increase the errors in the scores. It should 
not be overlooked, however, that the test used reduces the directions 
to be given by examiners to a minimum and that, therefore, the errors 
due to lack of training of the persons giving the tests are less than 
would otherwise be the case. The writer does not believe, however, 


2As used in this bulletin, the term “reliability” is practically equivalent to “accuracy.” 
It is not limited to its sometime narrow technical meaning referring to the agreement be- 
tween two sets of scores on the same measuring instrument, although it includes this. 
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that this factor of added reliability is sufficient to balance the two of 
brevity and administration by poorly qualified examiners. 

Another fact which probably affected the significance of the test 
scores was that about half of the seniors tested had never taken an 
intelligence test before and it is likely that many of their scores, when 
compared with those of the seniors who had taken such tests previ- 
ously, do not fairly represent their mental ability. Furthermore, be- 
cause of the conditions under which the tests were given, there was 
generally no particular incentive, apart from the desire to excel, for the 
seniors to do their best. Hence, it is likely that a considerable number 
of them did not put forth maximum effort while taking the test. These 
and all other causes which produce variable or accidental errors in 
the test scores result in lowering the correlations and other predictive 
indices based thereon and justify the conclusion that the real rela- 
tionships are somewhat closer than those actually computed. 

Too much evidence and discussion concerning the subjectivity and 
unreliability of school marks has appeared within the last few years 
for the subject to need extended comment in this connection. Undoubt- 
edly the errors present in the marks were increased somewhat by the 
fact that marks from several hundred high schools and more than a 
hundred colleges with different systems and standards were trans- 
muted to a common basis and thrown into a single group. In spite of 
the fact that the transmutation was made with great care and followed 
sound statistical procedure, it was not possible, in all cases, to be sure 
that the transmuted marks were really equivalent to the original ones. 
The effect of increasing such variable errors was to lower the co- 
efficients of correlation and other predictive measures secured. 


The computation of zero-order coefficients of correlation. As was 
previously stated, the chief method employed to determine the rela- 
tionship between college marks and other data available was that of 
correlation. In the case of each of more than fifty college subjects or 
closely related groups of subjects* simple or zero-order coefficients of 
correlation were computed between the marks therein, and those of 
the other data that seemed most likely to have predictive value. In 
all cases the college marks were correlated with intelligence test scores 
and general high-school averages, and in the case of each subject with 
the marks in those high-school subjects or groups of subjects that 
seemed most likely to show the closest relationships. The correlations 

8In a number of cases it is doubtful just what really constitutes a “subject” as the 
rere only a few freuen who caster sath of fie ase gee oe 
followed was to group them together as a single subject. Agriculture, therefore, includes 


various courses in agronomy, animal husbandry, and so forth; art includes freehand draw- 
ing, painting and sculpture, and so on with others. 
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previously obtained in the study of freshman college marks were used 
as a guide in determining those which it seemed most profitable to com- 
pute in this study. All college subjects or groups of subjects which 
were carried by enough students that the resulting data would be fairly 
reliable were dealt with in this manner. 


The computation of coefficients of multiple correlation and re- 
gression. The calculation of zero order or simple coefficients of cor- 
relation was followed by that of multiple coefficients. In view of the 
considerable amount of labor involved in computing the latter they 
were not found for all college subjects, but for only about 40 percent 
of them. These were in general the subjects carried by the largest 
numbers of students and those for which the simple correlations were 
highest. In connection with this the admission should be made that 
since many possible correlations were not computed, it is probable that 
some were omitted which should have been found. Since the amount 
of money available for clerical help, although fairly large, was not un- 
limited, it was necessary that the line be drawn somewhere, and it is 
very likely that the writer’s judgment in selecting the most promising 
possibilities was not infallible. In the case of several of the college 
subjects two or three groupings were made according to the high-school 
subjects carried and a different set of multiple correlations computed 
for each grouping. For example, in addition to calculating the correla- 
tions and regressions for all students who carried French as a college 
subject, they were also found separately for the portion of this group 
that had carried high-school French and also for that which had taken 
high-school Latin. Also in the cases of a few college subjects separate 
correlations were computed according to the number of hours of work 
therein carried by different groups of students. The most common 
basis was to put into one group students who had carried less than 
ten hours and into another those who had carried ten or more. Some- 
times the latter group was divided into two, one of which contained 
those who had carried from ten to nineteen hours inclusive, and the 
other, those who had carried twenty or more. 

The general procedure in computing the multiple coefficients was to 
start with the highest one of zero order and combine the others of the 
same order with it until the addition of another criterion no longer 
increased the obtained coefficient by as much as .01. Because of the 
fact referred to above, that many of the simple coefficients of corre- 
lation were really multiple in nature though not in derivation, it could 
not be expected that on the whole there would be as great an increase 
in the multiple coefficients over those of zero order as would otherwise 


have been the case. 
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The question may be raised as to why certain combinations, which 
will appear later in the chapter containing the multiple correlation 
results, were made, in view of the fact that one of the simple correla- 
tions already used was that of the freshman mark with the general 
high-school average or the average in a group of similar subjects, and 
another the correlation with one of the subjects which entered into 
this group. The reason for so doing is, however, clear to any one 
familiar with multiple correlation. In computing the high-school gen- 
eral average or the average in any group of similar subjects the marks 
entering into the given average were all allowed the same weight in de- 
termining it. By means of multiple regression equations, however, 
one is able to determine the optimum weight which should be given 
to each factor, that is, the weight to give it so that the highest correla- 
tion or predictive power will be obtained. 

The direct method of securing the same result would be to use 
no averages of marks in different high-school subjects, but to consider 
each as a separate variable or criterion in the multiple correlation work. 
The reason this was not done was that it would have increased very 
greatly the amount of calculation necessary without yielding more 
helpful results than the method used. It would, of course, have shown 
exactly just which of the subjects entering into the high-school average 
were useful for making the best prediction in each case and which were 
not, but there seems little advantage in knowing this, provided one 
knows how to make as good an estimate without this knowledge and 
with even less labor. Not only was much work saved in computation, 
but also in the use of results, since the multiple coefficients secured 
involve, on the whole, fewer variables or criteria than would be the 
case 1f averages of high-school subjects had not been taken and there- 
fore require less computation in employing them for predictive pur- 
poses. The objection can be raised that there are included in the 
general high-school average marks made in subjects which show much 
lower correlations with the freshman subject being considered than 
do those of certain other high-school subjects and that the inclusion 
of these marks may have lowered the correlation between the fresh- 
man subject marks and the high-school average. This contention is 
true, but the writer believes that for all practical purposes any such 
results have been taken care of by including in the multiple correlations 
the subjects which appeared at all likely to make any contribution 
to them. Thus, for example, if freshman French mark was best pre- 
dicted by a combination of high-school marks in English, French and 
Latin and point score, rather than by including the general high-school 
averoee, the method of computation used eliminated the latter. In any 
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event, in view of the practical limitations of time and money, it seemed 
advisable, if not absolutely necessary, to follow the method described 
above. 


Minor methods of prediction employed. In addition to the method 
of correlation a few minor methods were employed and will be pre- 
sented later. By these means will be shown the relationship between 
college success and size of high school attended, occupational inten- 
tions, fathers’ occupations, and so forth: Since most of these involve 
one factor that cannot well be expressed in numerical classes and, 
therefore, does not lend itself to the computation of coefficients of cor- 
relation,* the method of summarizing such data will ordinarily be 
by means or medians. 


The measures of accuracy of prediction obtained in this study. 
Finally, as a measure of the accuracy or reliability of predictions based 
upon coefficients of correlation, the corresponding coefficients of aliena- 
tion and probable errors of estimate were determined. The first of 
these,° the coefficient of alienation, is an expression that shows the 
relationship between the prediction based upon a given coefficient of 
correlation and a pure guess. For example, the coefficient of alienation 
which corresponds to a correlation coefficient of .65 is approximately 
./6. This means that if two variables or series of scores correlate .65 
with each other, the estimates of particular scores in one series based 
upon corresponding known scores in the other will on the average 
be in error by about .76 as much as if the errors resulted from pure 
guesses, or, subtracting .76 from 1.00, that the errors will be .24 
smaller than those in pure guesses. 

The probable error of estimate describes the same situation by 
stating the limits within which half of the errors will fall. For ex- 
ample, if the probable error of estimate is found to be 4 points on the 
percentile scale, it means that half of the estimated scores will not vary 
from the true scores by more than 4 per cent, and, of course, that the 
other half will differ by more than this amount. These two indices, 
the coefficient of alienation and the probable error of measurement, 
give a more concrete and meaningful description of the accuracy of 
prediction than does the coefficient of correlation. 


4As may be seen by reference to textbooks in statistics, there are methods for com- 
on of non-numerical data, but since they are not familiar to most readers 


puting correlati ; 1 2 é 5 xe 
of this bulletin, and since it does not appear to the writer that they provide more significant 
interpretations of the data than does the use of medians, they will not be employed herein. 


5For a more complete discussion of the coefficient of alienation and the probable error 


imate, see Chater VI. Also: i 
S © Odels, C. W. “The Interpretation of the Probable Error and the Coefficient of Cor- 
University of Illinois Bulletin, Vol. 23, No. 52, Bureau of Educational Research 
f University of Illinois, 1926, p. 28-32 and 41-45, and 
Century Company, 1925, p. 173-74, 


relation,” 
Bulletin No. 32. Urbana: | 7 0 
Odell, C. W. Educational Statistics. New York: 


230-41, or some other text on the same subject. 


CHAPTER III 


THE PREDICTION OF COLLEGE MARKS FROM THE 
_ DATA AVAILABLE AT COLLEGE ENTRANCE 


The simple correlations between college marks and the other data. 
In Table I the simple or zere-order coefficients oi correlation between 
college marks and the data available in this study at the time of col- 
lege entrance are given. The college marks used were those in fifty- 
five subjects or closely related groups of subjects for the whole num- 
ber of hours thereof carried by the individuals included in the study. 
The fifty-five include all subjects or groups of subjects which were 
pursued by enough students that they seemed worth dealing with, 
especially from the standpoint of the reliability of the results. In the 
column headed “Point Score” are the coefficients of correlations be- 
tween the college marks just mentioned and-the point scores on the 
intelligence test given when the students were high-school seniors. In 
the next column are the correlations between college marks and high- 
school averages. To the right of this column are the correlations be- 
tween the college marks and those in the high-school subjects or groups 
of subjects that seemed most likely to have the closest relationships 
with them. These were chosen largely on the basis of the results ob- 
tained for freshman college marks and published in the report of that 
study. Thus the first row of the table, for example, shows that marks 
in college accountancy correlated .29 with point scores, 45 with high- 
school averages, .29 with average marks in all high-school commercial 
work, and .47 with those in all high-school mathematics. In some cases 
there were no high-school subjects or groups of subjects that appeared 
sufficiently similar to the college subjects dealt with that it seemed 
worth while to correlate the marks therein. As is stated by the first 
note at the bottom of the table, those coefficients enclosed in paren- 
theses are not reliable since they are not equal to more than twice their 
standard errors or three times their probable errors. The chances are 
greater than twenty-one or twenty-two to one that all of the coefficients 
not enclosed in parentheses are significant, and for most of them the 
chances are very much greater than this. 

li the figures given in this table for complete college marks are 
compared with the corresponding ones for college freshman marks as 
previously reported,* it will be seen that on the whole there is very 


V. “Predicting the Scholastic Success of College Freshmen,” University oF 
, Vol is reau of Educational Research Bulletin No. 37. Ur 
mois, 1927, Tahbic Ti, opposite p. 28 
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TABLE I.—CoeErriciEnTs oF CORRELATION oF CoLLEGE Marks Wrru Point Scores 
AND Higu-Scuoot Marks 


‘ z High-School High-School Subjects and 
College Subject Point Score Average Groups of Gabjecta 

Accountancy.......... 29 45 Comm. .29, M 

QUECANL OY, crafstarsfeleteyereialels siei5)> ; . . .29, Math. .47 
PAGTICUILULG]\.scre fueMermca cn .38 51 Agr. .56, Sci. .46 
PNER EOI Y ete craic croc es (.10)8 45 Sei. .30 

OAR EOS BIDE e SOE GE OOS .19 .26 Art .41 
PASHPODOMY aah sic. cr ars See re ical 42 41 Math. .39 
Athletic Coaching............. 34 .43 
Bacteriology... ..0... 0-22.00 (Gulp) (.19) Biol. .25 
ii Assne do scuneudoosbodasae 26 41 Eng. .40 
Biclogy AR fee sromerseineais sale oe 24 48 Biol. .34, Gen. Sci. .37, Sci. .42 

OUD Yee ejaic crete sia nelaics sik 42 43 Biol. .48, Gen. Sci. .38, Sci. .42 
PB UsINOSS LA Weiss sc. 0 ssieieiais esece's 25 .23 
ae Org. and Oper............ (.12) BD) 

INEM Zr oo oogodopddseDeee .33 Al Chem. .48, Gen. Sci. .25, Phys. i 
Sei at cant. (18) “31 Hiss a et ae oo ek eS 
Drawing etc mceiin donceter 15 .20 
Heonomics.........--......+- 26 39 Civ. .32, Econ, .32, Hist. .40 
Bduca tone es cricie wielsae een « 28 .39 
LNG, cect opeoocice sosnagess .33 45 Eng. .38 
Elec. Engineering............. (.00) (.05) Math. .10 
renGh Soriret seiner eewiceine vee “Eyl 52 For. Lang. .48, French .50, Lat. .47, Span. .43 
Gen. Engin. Drawing.......... 30 BR Math. .29, Free Dr. .38 
Geograph yicessses ose. ce eee 38 Al Geog. .24, Sci. .35 
(Germans. as ccs sefee wets nis sieie<ie 28 45 For. Lang. .41 
Geologyek aed views wos te 17 -40 Sei. 33 
(Greckweern etka melesteysie cis atate's (.23) 48 For. Lang. .49, Lat. .44 
History....... re eee 5 Al Hist. .41 
Home Economics............. .20 .28 | 
JEhPutG)-S sadooc csecosencoene 14 .36 Sei. .84 
ouenaligm sep eis.cc ss.2 ss escts <tr 64 56 | Eng. 56 
PANE UAGO eet ataere ole re oleiel sis 25 36 | Eng. 31, For. Lang. .43 
BAGH) ateieete is «2 eves nnie 25 47 | For. Lang. .34, French .41, Lat. .39 
Library. ... 35 48 | Eng. .45 
Mathematics . ve} AT Alg. .88, Geom. .42, Math. .44 
Military..... (.04) 21 
iG hin TO GOR Re oe te eee ener 02 .30 
Nature Studyjeu sass e251 (.11) 29 
Oplentatlon. essere nace acl seki 42 ot! Hist. .61 
Philosophy:ererise ceric esi ieictsiere 13 84 Eng. .34, Hist. .33 
PHONICS eiae ta cisetatevcincts stsiclens <7 387 42 Eng. .33, For. Lang. .48 
Physical Education........... .18 Sly 
IPH BIOS Serres aiclel<teialsyelcie=isserele aie 33 39 Geom. .35, Math. .37, Phys. .30, Sci. .36 
Ph ysiologyanriares tiles eclrstele 38 39 Gen. Sci. .20, Sci. .38 
Political Science.............. 20 29 Civ. .34, Hist. .27 
psy cholog yasestasnineue come c= 43 33 Sei. 37 
Public Speaking......-....... 19 BY | Eng. .35 
Reading ete is ic tecass: 30 35 | Eng. .35 
Religious Education........... (.10) .36 
IRBGLORIG rete ass Mane ele sininisl .B4 46 Eng. .46, Lat. .39 
Scandinavians acs)! s\ere1s-0'0° (.28) (.31) Eng. 41 
MO CIOLOR Vine etnaTerctetrs vererinvelol* 127 40 Civ. .35, Hist. 385 
SAMS MMe er eee eyelet rataren sc ete 20 52 For. Lang. .49, French .52, Lat. .46, Span. .51 
Stenography.........- hoe (3d) .39 Eng. .34 
Theor. and App. Mech......... (.01) 29 Geom. .34, Math. .37 
Transportation. ........-....- (.16) soe ; e 
LOOLOR Yee acon aeeteleirsle « 38 48 | Biol. .40, Sci. .40 
College Average>.............. ol 54 | 


The coefficients enclosed in parentheses are not reliable. 


each, they are not equal to more than twice their standard errors, 


parentheses are all larger than this ratio. 
bThe coefficients following ‘College 


averages of the columns above them. 


Because of the small number of cases contributing to 
or three times their probable errors. Those without 


Average” are those of the average freshman mark in all subjects, not the 
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little difference. The correlation between complete college averages 
and point scores given here, .31, is several points lower than that 
between freshman averages and point scores, which was found to be 
.38. The correlation between complete college averages and high-school 
averages, .54, is almost exactly the same as that between freshman 
averages and high-school averages, which was found to be .55. If the 
coefficients of correlation of the separate college subjects with point 
scores, high-school averages and high-school subjects or groups of sub- 
jects are averaged, they are found to be, respectively, .26, .38, and .39. 
For freshman marks the corresponding figures are: .27, .38, and .38. 
Thus in no case is the difference greater than .01, which shows that 
the relationship between marks received during the whole period of 
attendance at college and the data available at college entrance is almost 
exactly the same as that between freshman college marks and the same 
data. 

In the cases of a number of the different college subjects included 
in Table I, some or all of the coefficients of correlation are decidedly 
higher or lower than the corresponding ones for freshman marks, but 
in general the differences are not great. There is a strong tendency 
for the same subjects to have high correlations, likewise for the same 
ones to have low correlations. For example, agriculture, French, and 
Spanish are among those relatively high in both tables, and art, mili- 
tary work, and physical education among the low ones. 

An inspection of the table shows that there are very few coeffi- 
cients of .50 or above. The only college subject that correlates more 
than this with point score is journalism, for which the coefficient is 
.64, The correlations with high-school average are somewhat higher, 
there being five subjects which do so to the extent of .50 or above. One 
of these, orientation, gives a coefficient of .71, but none of the others, 
agriculture, French, journalism, and Spanish, rise very much above 
.90. General college average, it will be noted, correlates .54 with high- 
school average. Among the correlations between college subjects and 
high-school subjects, or groups of subjects, there are about half a 
dozen that reach .50. Agriculture correlates .56 with high-school agri- 
culture; French, .50 with high-school French; journalism, .56 with 
high-school English ; orientation, .61 with high-school history ; Spanish, 
.92 with high-school French and .51 with high-school Spanish. When 
it is recalled that a coefficient of correlation of .50 is equivalent to a 
coefficient of alienation or element of uncertainty in prediction of 
almost .87, it will be seen that the accuracy with which college marks 
can be predicted by simple correlation with point scores, high-school 
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averages or marks in high-school subjects, is decidedly low. In the 
following section the accuracy of prediction when multiple correlation 


. is employed will be discussed. 


The multiple coefficients of correlation between college marks and 


_the other data. Twenty-three of the fifty-five college subjects listed 


in Table I were selected for the application of multiple correlation. In 
general these were the subjects in which there were the largest en- 
rollments or for which the simple correlations were the highest. In 
several of the subjects two groupings were made according to the 
number of hours of college work carried by students therein and sepa- 
rate results computed for those who had carried the smaller and the 
larger numbers of hours. Furthermore, in four subjects separate tabu- 
lations were made according to the high-school subjects carried by the 
individuals concerned. 

Table II presents, in addition to certain other related data, the 
highest multiple coefficients for each of the twenty-three subjects. The 
first set of four columns therein is for the highest simple coefficient 
of correlation obtained for each of the subjects mentioned, the second 
group of four is for the highest multiple coefficient, and the last group 
of three is for the increase of the multiple over the simple coefficient. 
Within each of the two groups of four the first column, headed “r” 
and “R,” contains the actual coefficients of correlation; the second, 
headed “‘k,” the corresponding coefficients of alienation; the third, 
headed “P.E..s:,”’ the corresponding probable errors of estimate; and 
the fourth, the one or more criteria used in the correlations. The last 
three columns contain, in order, the increases in the highest multiple 
over the highest simple coefficients of correlation and the accompanying 
decreases in the coefficients of alienation and the probable errors of 
estimate. For example, taking the first line of the table, the highest 
simple coefficient of correlation of the accountancy mark with any 
single criterion was .57, the corresponding coefficient of alienation was 
.82, the probable error of estimate, 5, and the criterion, high-school 
mathematics mark. The highest multiple correlation obtained for alge- 
bra was .58, with a coefficient of alienation of .82 and a probable error 
of estimate of 5. It was based on two criteria, high-school mathematics 
mark and point score. The increase in the coefficient of correlation 
was .01, whereas there was no change in the coefficient of alienation 


or the probable error of estimate. 


ase in the coefficient of correlation and de- 


3 d that an incre : A 
It should be remembere probable error of estimate indicate closer re- 


creases in the coefficient of alienation and the 
lationship or greater accuracy of prediction. 
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It will be seen from the table that only in the cases of agriculture, 
chemistry (20 +-)II, and orientation were multiple coefficients of .70 
or above obtained from the best possible combination of criteria. In 
just about one-fourth of all cases were the highest multiple coefficients 
as large as .60, whereas in more than one-third they were below .50. 
Most of the corresponding coefficients of alienation were between .80 
and .90, although a few were smaller. The probable errors of estimate 
were in most cases either four or five points on the percentile scale. 
The smallest, those for agriculture and economics (20 +), were two. 
If these results are compared with those given in the report of the 
study of freshman college marks,* it will be seen that there is a slight 
but not very strong tendency for those found in this study to be higher. 
None of the coefficients of correlations for freshman marks were above 
.63, and, correspondingly, none of the coefficients of alienation were 
below .78. On the other hand, the proportion of multiple coefficients 
below .50 is almost the same in the one case as in the other. The con- 
clusion to be drawn appears to be that complete college marks can 
probably be predicted with slightly greater accuracy than can freshman 
marks in a number of subjects, whereas in others there is practically 
no difference in the certainty of prediction. 

A comparison of the multiple correlation coefficients with the simple 
ones given in Table II shows that the increases of the former over the 
latter were comparatively small. In only two cases, those of botany 
and zoology, were they greater than .10, and in well over half of them 
they were less than .05. Correspondingly the decreases in the coefficient 
of alienation were in only six cases as great as .05 and in more than 
half not greater than .02. In only four cases was the decrease in the 
probable error of estimate great enough to appear when whole num- 
bers were used. The situation may be summed up by saying that the 
increased reliability of prediction obtained by using multiple instead of 
simple correlation was in very few cases sufficient to justify the ad- 
ditional labor and expense involved. 

An inspection of the columns of Table II, in which the criteria are 
given, shows that, with few exceptions, the most helpful one was either 
the high-school average or the mark in the same or a similar high- 
school subject or group of subjects. For only three of the simple cor- 
relations included in the table were point scores the best criteria of 
prediction. On the other hand, most of the groups of criteria involved 
in t'e multiple correlations included point scores among them. Ap- 
parently, therefore, the intelligence test scores, although less valuable 


4Odell, op. cit., p. 36-37. 
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in prediction than high-school marks, do contribute a somewhat differ- 
ent element than do marks. 


Relationship between college marks and vocational choices. It has 
sometimes been suggested that the vocational choices of college stu- 
dents are indicative of the quality of their college work. A tabulation 
was, therefore, made to show the relationship of this sort existing in 
the case of the students included in the study. It seems scarcely worth 
while to give the complete table showing this relationship, since the 
significant points thereof may be stated in comparatively few words. 
The results tend to be similar to those found by other investigators 
to the effect that students who have chosen vocations that require 
longer periods of training or, in other words, those that are generally 
termed professions, tend to make somewhat higher marks than do 
those who have made other choices. The differences found in this 
study, however, are comparatively small, and in the case of one or 
two vocational groups exceptions exist. Those who have made the 
following vocational choices averaged approximately 2 per cent or 
more above the total group: law, journalism and other publicity work, 
and philanthropic work. The only professional group below the aver- 
age was in medicine, and this was so slightly below that the difference 
was not significant. The average for those who had not made voca- 
tional choices was enough below that of those who had made such 
choices that the difference was statistically significant. On the whole, 
however, there was such great overlapping shown that it can scarcely 
be said that in the case of any individual student the knowledge of 
what vocation he has chosen or, indeed, of whether he has made any 
such choice or not, affords significant help in predicting his average 
college mark. 

Relationship between college marks and fathers’ occupations. The 
results from a comparison of college marks with the occupations of 
the students’ fathers’ indicated that the latter had practically no con- 
nection with the former. The children of lawyers and philanthropic 
workers made marks far enough above the general average that the 
differences were significant, and those of transportation employees far 
enough below to be barely significant, but in the case of no other group 
was the difference appreciable, and even in these they were barely 
statistically significant. It seems, therefore, even more true of fathers’ 
occupations than of vocational choices that they are of so little value 


the students filled out the question 


, ivi at the time : 
STf the students’ fathers were living fi Gore dead, ihe. lasieccunaiod 


blanks, their occupations at that time were reported. If 
in which each was engaged was reported. 
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for predicting college marks that they are probably not worth using 
in individual cases. 


Summary. In this chapter the simple and multiple correlations 
between college marks and high-school marks and point scores have 
been presented and also a brief statement given of the relationship 
between college marks and vocational choices and fathers’ occupations. 
In general, the use of the best single criterion for predicting college 
marks involved a coefficient of alienation or uncertainty of prediction 
of from .80 to .90, and in no case was it less than .70. The situation 
is only slightly better when the method of multiple correlation is em- 
ployed, since the smallest resulting coefficient of alienation was .64 
and since only about one-fourth of the coefficients of alienation were 
less than .80. Occupational choices offered a little evidence as to col- 
lege marks, and fathers’ occupations, practically none at all. 


CHAPTER IV 


THE PREDICTION OF MARKS DURING THE LAST THREE 
YEARS OF COLLEGE FROM DATA AVAILABLE 
AT THE END OF THE FRESHMAN YEAR 


The simple correlations between marks during the last three years 
and the other data. Only those simple coefficients of correlation be- 
tween the marks received by college students during their last three 
years and the other data were computed that were needed for the 
multiple correlations to be obtained. Just a dozen subjects were dealt 
with, these being the ones carried by the larger numbers of students 
during their last three years, and likewise during the freshman year. 
In other words, certain subjects commonly not begun until after the 
freshman year were not included in these computations. Since the 
highest simple coefficients for the various subjects are given in Table 
III, no separate table containing all of the simple coefficients will be 
included. 

An inspection of those given in the first portion of this table shows 
that the simple correlations between marks during the last three years 
and the best single criteria tended to be slightly higher than the cor- 
responding correlations for complete college marks as shown in the 
first portion of Table II. Indeed, none of the coefficients for the last 
three years were quite as large as two of those given in Table II. In 
all but three cases the coefficients in Table III were larger than those 
for the same subjects in Table II. Probably the most noteworthy in- 
crease was in the coefficient for the general college average. For the 
last three years’ work the average correlated .69 with the best criterion, 
whereas for the complete college record the corresponding figure was 
only .54, a difference of .15. All of the subjects, biology, French, 
Latin, and Spanish, had simple coefficients with their best criteria of 
between .65 and .70, and corresponding coefficients of alienation of .75 
or slightly less. The probable errors of estimate were in most cases 
four percentile points. 

It will be seen that in every case except one, the criterion which 
gave the highest correlation with the marks made during the last three 
years was either the general freshman average or the freshman mark 
in the same subject. The one exception to this was zoology, which 
showed a higher correlation with point score than with any other cri- 
terion. 

The multiple coefficients of correlation between marks during the 
last three years and the other data. The multiple coefficients of cor- 
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relation presented in the second portion of Table III showed about the 
same increases over the simple correlations for the same data as were 
found in the case of complete college marks. Although none of the 
simple correlations in Table III were higher than the largest of those 
in Table II, yet one of the multiple correlations for the last three 
years, that for Latin, was greater than any corresponding one in Table 
II. Several others for the last three years, in addition to Latin, were 
also above .70, and, correspondingly, sevéral of the coefficients of alien- 
ation were below .70. In only two or three cases, however, were the 
increases in accuracy great enough to result in an integral change in the 
probable errors of estimate. 

Just as in the case of the simple correlations, so in the multiple 
ones the point score appears very rarely. Instead, high-school marks, 
in most cases those in particular subjects but in a few the general 
high-school average, ranked next to freshman average and fresh- 
man subject marks in worth as criteria. Indeed, except for the one 
subject of zoology, it appears that very little accuracy of prediction 
would have been lost if no intelligence test scores at all had been 
available. 


Relationship between marks during the last three years of college 
and vocational choices and fathers’ occupations. The relationship 
found to exist between the marks made during the last three years 
of college and vocational choices of the students and their fathers’ 
occupations will not be presented in tabular form. It was, in general, 
distinctly lower than that found. in the case of the complete college 
records and the criteria mentioned. There was a tendency for the same 
trends to appear, but none of them were marked enough to indicate 
conclusively that either vocational choices or fathers’ occupations were 
valuable enough criteria of prediction to warrant their use. 


Summary. The simple and multiple correlations between marks 
made during the last three years of college and those received during 
the high-school course and the freshman year of college were found to 
be somewhat higher than those between the complete college records 
and high-school marks alone. On the other hand, the coefficients be- 
tween marks during the last three years and point scores were some- 
what lower than were those in the case of complete college records 
and point scores. Likewise, the relationship of three-year marks with 
vocational choices and fathers’ occupations was less than in the other 
case. Indeed, the statement is justified that only in a few cases did 
intelligence test scores, and in almost none at all, did vocational choices 
or fathers’ occupations offer evidence of sufficiently predictive value 


to justify their use. 


CHAPTER V 


THE RELATIONSHIP BETWEEN PERSISTENCE IN 
COLLEGE AND VARIOUS OTHER DATA 


The relationship between persistence in college and intelligence 
test scores. Table IV shows the relationship between persistence, that 
is, length of attendance in higher institutions, and intelligence test 
scores. In this and the succeeding tables that deal with length of at- 
tendance, students have been grouped according to the number of 
semesters they remained in college, and also according to whether they 
were dropped! or left voluntarily with or without graduation at the 
end of each. For example, all students who left at the end of one 
semester were divided into those who were dropped and those who 
left voluntarily. Similarly, those who left at the end of four semesters 
were divided into those who were dropped, those who quit voluntarily 
without completing a given course, and those who were graduated 
from a two-year course. Undoubtedly many of those who were not 
dropped but who did not continue were doing work of such poor 
quality as to be on probation or otherwise in danger of being dropped 
and, hence, did not return. No attempt, however, was made to sepa- 
rate these from the others who quit of their own will. 

Similar to the results in other studies, the data from this investi- 
gation show a considerable relationship between the two items dealt 
with. This may be seen best by looking at the row of medians and 
that of per cents dropped at the bottom of the table. Although there 
are some irregularities, it will be seen that the median persistence or 
time attended had a tendency to increase along with the intelligence 
test score. Thus, for the groups that had scores of from twenty to 
twenty-four and twenty-five to twenty-nine, the lowest two that had 
more than ten cases, the median time attended was three semesters. 
For the next four higher groups it was four semesters, for the next 
four it was six, and finally, for the highest it was four-year gradua- 
tion. Although the per cents of students dropped were somewhat more 
irregular, they still show the same general trend. Students who made 
test scores of sixty or above averaged more than one and one-half 
times as long in attendance as those whose scores were below fiity 
and had only about one-half as great a chance of being dropped. 


1The students reported as dropped include both those dropped because of poor scholar- 


ship and those dropped for disciplinary or other reasons. The former, h i 
much the larger portion of the group. Pn ae a a 
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The relationship between persistence in college and high-school 
averages. Table V is similar to Table IV, except that it deals with 
high-school averages rather than with point scores. From an inspection 
of Table IV and a comparison thereof with the last table, the relation- 
ship between high-school averages and persistence in college appears 
to be about the same as that between intelligence test scores and per- 
sistence. The medians given at the bottom of Table V show a fairly 
regular increase, except for a slight drop at the upper end. The per 
cents dropped decrease with somewhat greater regularity than the cor- 
responding per cents in the case of intelligence test scores, although 
here also there is an irregularity at the upper end. It appears that 
students with high-school averages of ninety or better averaged about 
twice as long in college attendance as did those with similar averages 
below eighty. Furthermore, the chance of a student with a high- 
school average above ninety being dropped from college was only 
about one-tenth as great as that of a student whose high-school aver- 
age was below eighty. 


Relationship between persistence in college and vocational choice. 
Table VI shows the relationship found between the length of time 
students remained in attendance in college and their vocational choices. 
The degree of relationship shown is not so great as some similar studies 
have reported, but it appears to be slightly greater than that between 
college marks and vocational choices. From the table it will be seen 
that, although the average length of attendance of those included was 
only four semesters, there were several vocational groups including 
enough students to be fairly reliable that had averages of six semesters 
of attendance or more. With the exception of the farming group, all 
of these are what are commonly termed “professional.’’ On the other 
hand, none of the groups, not even the one with no vocational choice, 
fell below the general average of four semesters. 


Relationship between persistence in college and father’s occupa- 
tion. No table will be given to present the data showing the degree 
of relationship between persistence in college and fathers’ occupations. 
The data suggested more or less the same relationships as were found 
between persistence and vocational choices, but they were on the whole 
less pronounced. The children of those engaged in philanthropic work 
were found to remain in college an average of seven semesters, and 
those whose fathers were engaged in clerical work, law, medicine, and 
publicity work, six semesters. None of the other groups showed any 
significant deviation from the average. 
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Relationship between persistence in college and freshman col- 
lege average. Table VII is similar to Table V, except that it deals 
with freshman college averages rather than high-school averages. As 
most readers would probably expect, it shows that there was a closer 
relationship between freshman averages and length of college attend- 
ance than existed in the case of any of the other data presented in 
this chapter and the latter. The median length of attendance rose very 
sharply from the group having the lowest freshman average to that 
with the next to the highest. The very highest group, which was quite 
small, had a somewhat smaller average length of attendance. Those 
in the lowest freshman average group, the one with marks below sixty- 
five, were practically all dropped the first semester, and almost all of 
those in the next group, from sixty-five to sixty-nine, were dropped 
by the end of the second semester, whereas the median attendance for - 
the group whose averages were ninety to ninety-four was eight se- 
mesters. In the case of per cents dropped the differences were per- 
haps even more striking. Ninety per cent of the lowest group and 70 
per cent of the next to the lowest group were dropped, whereas only 
2 per cent of the group with averages from eighty-five to eighty-nine, 
and none of the group with averages from ninety to ninety-four, were 
dropped. It appears that if a student had a freshman average of 
eighty-five or above, he had only about one chance in eighty of being 
dropped at any time, whereas if his average was below seventy, there 
were about three chances out of four that he would be dropped, if it 
was below seventy-five, almost one chance in two, and if below eighty, 
almost one in three. 

Summary. From the data obtained in this study it appears that 
there is a definite positive relationship between the lengths of time 
students remain in attendance in college and their intelligence test 
scores, their high-school averages, their vocational choices, and their 
freshman averages. The relationship is the closest in the case of fresh- 
man averages. There is also some relationship between persistence and 
father’s occupation, but it is scarcely enough to justify using the latter 
as a basis of prediction. If a student is in the upper quarter or even 
the upper third of both intelligence test scores and high-school aver- 
ages, it is very unlikely that he will be dropped from college, whereas 
if he is in the lower quarter or third of both, his chances of being 
dropped are fairly large, and the chances are at least eight or ten to 
one that he will not remain in college much, if any, longer than two 
years. If both freshman average and intelligence test score are high or 
low, the corresponding chances are much larger, indeed, almost double. 
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CHAPTER VI 
SUMMARY AND CONCLUSIONS 


The problem. The problem of this investigation has been to de- 
termine how accurately the marks and length of attendance of college 
students can be predicted from certain data available when they enter 
college and also how accurately those for the last three years of col- 
lege can be predicted from data available at the end of the freshman 
year: 


The prediction of college marks from data available at entrance. 
The simple correlations between college marks in fifty-five subjects or 
groups of subjects and high-school marks and intelligence test scores 
were found. Very few of the coefficients were above .50, the majority 
of those with high-school marks ranging from .30 to .50, and of those 
with point scores from .20 to .40. The corresponding coefficients of 
alienation, which measure the guessing element, were in no instance 
less than .70, and in most cases between .80 and .90. The multiple co- 
efficients of correlation for a number of the same subjects were in 
general only a comparatively few points larger than the highest simple 
coefficients for the same subjects. About one-fourth of the multiple 
coefficients were .60 or above, but the majority of them fell between 
AO and .60. A very few of the corresponding coefficients of alienation 
were around .70, but most of them fell between .80 and .90. The 
probable errors of estimate were generally four or five percentile 
points. Students’ vocational choices appeared to have a little signifi- 
cance in indicating their college marks, and their fathers’ occupations 
very little, if any. 

The prediction of marks in the upper three years of college. For 
those subjects carried by a fairly large number of students in the last 
three years of college, the simple and multiple correlations with data 
available at the end of the freshman year were found. On the whole, 
these correlations tended to be only slightly higher than those of the 
complete college marks with the data available at entrance. 

The prediction of persistence in college. Persistence or length of 
attendance in college appeared to have a decided relationship with 
students’ intelligence test scores and an even higher degree of rela- 
tionship with their high-school averages. That with vocational choices 
was somewhat stronger than in the case of college marks and voca- 
tional choices, and the same was true with regard to fathers’ occupa- 
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tions. For the last three years of college, persistence had a decidedly 
close relationship with freshman average. 


Conclusion as to the accuracy of predicting college success. 
Although, as has been shown previously, the criteria employed in 
this study did not yield highly accurate predictions of success in col- 
lege as measured by marks received and length of attendance, yet 
they exhibited some relationship therewith. In other words, if, as is 
true in many situations, some selection is to take place, these data 
offer a distinctly better basis than does mere chance for the selection 
of those students who will profit most by college attendance. More- 
over, other studies have shown that by the use of a longer and more 
reliable intelligence test, such, for example, as Thorndike’s Intelli- 
gence Examination for High-School Graduates, of the best types 
of entrance examinations or standardized tests covering work carried 
in high school, of ratings of study habits and other factors that con- 
tribute to success, the reliability of prediction can be increased con- 
siderably above that obtained in this study. By the combination of 
these means, it is probable that under favorable circumstances an 
institution of higher learning can obtain multiple coefficients of cor- 
relation of at least .75 or .80 for many of the subjects carried therein, 
and of above .80 for the general college average. Thus, for the 
latter it is probable that the coefficient of alienation, or guessing ele- 
ment, can be reduced to about one-half, and in some cases even some- 
what below this. 


APPENDIX A 


THE RELATIONSHIP BETWEEN SIZE OF HIGH SCHOOL 
ATTENDED AND COLLEGE SUCCESS 


From time to time there has been considerable discussion as to 
whether students who have had their high-school training in schools 
of different sizes show any consistent and significant differences in 
level of achievement and ability to do college work. In view of this 
fact it seemed worth while as a sort of by-product of this investiga- 
tion to tabulate the available data so that they might, if possible, 
throw some light upon this question. For this purpose Tables VIII 
and IX are presented. The first shows the relationship between 
size of high school attended and college averages, and the second, 
that between size of high school and persistence in college. In con- 
structing each, the high schools attended by the students included were 
classified into five groups on the basis of their size at the time the 
students were seniors in them, that is, in the school year 1923-24. As 
will be seen from both tables, there is some tendency for the students 
coming from the smallest group of high schools, that is, those with 
less than one hundred students, to be slightly below average. Their 
average college mark was only eighty-two as compared with eighty- 
three for all students, and their average length of attendance only 
three semesters, one less than the average for all. Also in the case 
of those coming from high schools of from 500 to 999, inclusive, a 
similar tendency is shown in both tables, since their average mark was 
eighty-four and their average attendance six semesters. In the cases 
of the other three groups according to size, no consistent departure 
from the general average appears. The apparent conclusion regard- 
ing the matter is, therefore, that on the whole there is little relation- 
ship between size of high school attended and college success as 
measured either by marks received or length of attendance. Students 
from the very smallest high schools are probably at a slight dis- 
advantage, whereas those from high schools of from 500 up to 1000 
students seem to have a very small advantage. 


Taste VIIL—REeEtationsuip BETWEEN CoLLEGE AVERAGE AND SIZE OF 
Hicu ScHoot ATTENDED 


Size of High School 


300-499 


. a 1 5 3 5 4 18 
oh, en 15 56 24 32 49 176 

RS-BEc 78 139 77 97 66 457 
ae ee 88 138 79 85 148 538 
re eee 65 91 38 48 95 337 
i, ee Peeeie 23 25 ar. Te 30 105 
2 eee 10 10 6 9 14 49 
Meee 7 5 2 2 3 
$50.2 ec. 1 
ME sie a. 1 1 
oe em 287 470 244 292 
Median.......... 82 84 84 84 


Taste IX.—ReELaATIONSHIP BETWEEN PERSISTENCE IN COLLEGE AND SIZE OF 
Hiex Scuoot ATTENDED 


Size of High School 
Length of Attendance 
1-99 100-299 300-499 500-99 1000 or more Total 
ik VP. PTAUUIALER Foon oo. se ao ss 58 128 58 104 137 485 
B SMCHTS. 5 cn a5 ese sees 13 27 ; 11 17 40 108 
Gacen: Gropped 4. .cccee se ace 0 0 / 1 1 1 3 
@BDENOER ck «4a s/c erne axe Ca Sate 7 8 8 6 33 
7 sem. dropped............... 2 3 2 2 2 ibe 
eT. PIRUORUER 4, coco ececee : 0 0 1 4 3 8 
GC ReMesteR. cote ws cx ane 16 / 37 23 28 31 135 
i hem. Gropped 06.2) es eee et 1 | 6 1 1 5 14 
Wee eee, ee ee eee 5 ; 7 7 10 ll 40 
5 sem. Cropped... loos sce 242 1 / 4 9 4 1 10 
2 ye. graduates: 8.0.0 25%. ac een 16 29 8 6 33 92 
A ROMCRENS. Ss cw San ose e ese 21 26 : 9 12 17 85 
af near. Cropped. aca sees ok a 16 29 15 14 18 92 
re 2, a eS aaa 35 | 37 23 23 43 161 
S Hel GTOPDEs soccy<cusce ass 2 2 5 3 8 20 
oe, ee ee Se ea 70 101 34 37 59 301 
2 sem. dropped. .............. 5 } 7 3 6 18 39 
1 semester. 6... s.cc.scceren ‘ 23 ~| 30 9 16 32 110 
i sem. dropped: «<< c<<.5s<-0% 10 12 9 5 14 50 
O nementers ba sacnrox an cane ss 14 19 4 13 19 69 
ee i alae 315 512 | ~—(38t 310 | 498 1866 
McHiaS sce uns ueese ee ae 3 sem. 4 sem. 2 yr. grad. 6 sem | 2 yr. grad 2 yr. grad. 


APPENDIX B 


NUMBERS OF SEMESTER HOURS OF DIFFERENT SUB- 
JECTS AND GROUPS OF SUBJECTS CARRIED 
BY FOUR-YEAR STUDENTS 


Although the matter dealt with in this appendix had no integral 
connection with the chief purpose of this study, it seemed to the writer 
that the presentation of figures showing how many semester hours 
of different subjects and groups of subjects were carried by students 
who completed four-year college courses might be of interest to many 
readers. For this reason Table X, which shows this for all subjects 
or groups of subjects carried by twenty-five or more four-year col- 
lege students, is presented. In it figures are given that show how 
many students carried a total of from one to nine, ten to nineteen, 
twenty to twenty-nine, and so on, hours in each subject or subject 
group, and also the total number carrying each subject. As would be 
expected, the largest number of students carried some work in 
English,? almost 90 per cent having done so. Physical education is a 
close second, since about 85 per cent took work in that subject. There 
is a distinct drop to the next subjects, which are education and mathe- 
matics, with less than 70 per cent of all students in each. The only 
subject in which any student carried as much as eighty hours or more 
was music, in which nine individuals did so, and the only others in 
which anyone carried as many as sixty hours were art and design, 
chemistry, English, and law, with no more than six doing so in any 
one of them. In most subjects the majority of students carried less 
than ten hours of work, although in several, more carried from ten to 
nineteen hours, inclusive, than any other number. In none did the 
hours carried by the most students rise above this. 


1The expression “four-year students’ is used to include all those who remained in 
school four years, that is, those included under the headings “four-year graduates, eight 


semesters,’”’ and “eight semesters dropped’’ in previous tables. et 
2The English colada in the table includes rhetoric, although the latter is given sepa- 


rately also, and all other work commonly classified under that general name. 
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